Search results for "theory [gamma rays]"
showing 10 items of 976 documents
Alignment-free sequence comparison using absent words
2018
Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…
Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions.
2020
Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, w…
2016
We determine knotting probabilities and typical sizes of knots in double-stranded DNA for chains of up to half a million base pairs with computer simulations of a coarse-grained bead-stick model: Single trefoil knots and composite knots which include at least one trefoil as a prime factor are shown to be common in DNA chains exceeding 250,000 base pairs, assuming physiologically relevant salt conditions. The analysis is motivated by the emergence of DNA nanopore sequencing technology, as knots are a potential cause of erroneous nucleotide reads in nanopore sequencing devices and may severely limit read lengths in the foreseeable future. Even though our coarse-grained model is only based on …
The Potential Role of Direct and Indirect Contacts on Infection Spread in Dairy Farm Networks.
2017
Animals’ exchanges are considered the most effective route of between-farm infectious disease transmission. However, despite being often overlooked, the infection spread due to contaminated equipment, vehicles, or personnel proved to be important for several livestock epidemics. This study investigated the role of indirect contacts in a potential infection spread in the dairy farm network of the Province of Parma (Northern Italy). We built between-farm contact networks using data on cattle exchange (direct contacts), and on-farm visits by veterinarians (indirect contacts). We compared the features of the contact structures by using measures on static and temporal networks. We assessed the d…
DeepWAS: Multivariate genotype-phenotype associations by directly integrating regulatory information using deep learning
2020
Genome-wide association studies (GWAS) identify genetic variants associated with traits or diseases. GWAS never directly link variants to regulatory mechanisms. Instead, the functional annotation of variants is typically inferred by post hoc analyses. A specific class of deep learning-based methods allows for the prediction of regulatory effects per variant on several cell type-specific chromatin features. We here describe “DeepWAS”, a new approach that integrates these regulatory effect predictions of single variants into a multivariate GWAS setting. Thereby, single variants associated with a trait or disease are directly coupled to their impact on a chromatin feature in a cell type. Up to…
A Thermodynamic Model of Monovalent Cation Homeostasis in the Yeast Saccharomyces cerevisiae
2016
Cationic and heavy metal toxicity is involved in a substantial number of diseases in mammals and crop plants. Therefore, the understanding of tightly regulated transporter activities, as well as conceiving the interplay of regulatory mechanisms, is of substantial interest. A generalized thermodynamic description is developed for the complex interplay of the plasma membrane ion transporters, membrane potential and the consumption of energy for maintaining and restoring specific intracellular cation concentrations. This concept is applied to the homeostasis of cation concentrations in the yeast cells of S. cerevisiae. The thermodynamic approach allows to model passive ion fluxes driven by the…
The role of spatial structure in the evolution of viral innate immunity evasion: A diffusion-reaction cellular automaton model
2020
Most viruses have evolved strategies for preventing interferon (IFN) secretion and evading innate immunity. Recent work has shown that viral shutdown of IFN secretion can be viewed as a social trait, since the ability of a given virus to evade IFN-mediated immunity depends on the phenotype of neighbor viruses. Following this idea, we investigate the role of spatial structure in the evolution of innate immunity evasion. For this, we model IFN signaling and viral spread using a spatially explicit approximation that combines a diffusion-reaction model and cellular automaton. Our results indicate that the benefits of preventing IFN secretion for a virus are strongly determined by spatial struct…
FastaHerder2: Four Ways to Research Protein Function and Evolution with Clustering and Clustered Databases.
2016
The accelerated growth of protein databases offers great possibilities for the study of protein function using sequence similarity and conservation. However, the huge number of sequences deposited in these databases requires new ways of analyzing and organizing the data. It is necessary to group the many very similar sequences, creating clusters with automated derived annotations useful to understand their function, evolution, and level of experimental evidence. We developed an algorithm called FastaHerder2, which can cluster any protein database, putting together very similar protein sequences based on near-full-length similarity and/or high threshold of sequence identity. We compressed 50…
Assessing statistical significance in multivariable genome wide association analysis
2016
Motivation: Although Genome Wide Association Studies (GWAS) genotype a very large number of single nucleotide polymorphisms (SNPs), the data are often analyzed one SNP at a time. The low predictive power of single SNPs, coupled with the high significance threshold needed to correct for multiple testing, greatly decreases the power of GWAS. Results: We propose a procedure in which all the SNPs are analyzed in a multiple generalized linear model, and we show its use for extremely high-dimensional datasets. Our method yields P-values for assessing significance of single SNPs or groups of SNPs while controlling for all other SNPs and the family wise error rate (FWER). Thus, our method tests whe…
SpaceScanner: COPASI wrapper for automated management of global stochastic optimization experiments
2017
Abstract Motivation Due to their universal applicability, global stochastic optimization methods are popular for designing improvements of biochemical networks. The drawbacks of global stochastic optimization methods are: (i) no guarantee of finding global optima, (ii) no clear optimization run termination criteria and (iii) no criteria to detect stagnation of an optimization run. The impact of these drawbacks can be partly compensated by manual work that becomes inefficient when the solution space is large due to combinatorial explosion of adjustable parameters or for other reasons. Results SpaceScanner uses parallel optimization runs for automatic termination of optimization tasks in case…